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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Middeldorp, Jaap Michiel. 

\ (ii) TITLE OF INVENTION: Peptides and nucleic acid sequences 

related to the Epstein-Barr virus. 

(iii) NUMBER OF SEQUENCES: 22 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Organon Teknika Corporation 

Biotechnology Research Institute 

(B) STREET: 1330-A Piccard Drive 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: USA 

(F) ZIP: 20850-4377 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version. #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: EP 92200721.6 

(B) FILING DATE: 13-MAR-1992 

(C) CLASSIFICATION 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Bobrowicz, Donna 

(B) REGISTRATION NUMBER: 32,196 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic,) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
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CATGATGGCA CGCCGGCTGC CCAAGCCCAC CCTCCAGGGG 
AGACAGTCCC CTGCTTCCTA AATTTCAAGA GCTGAACCAG 
TTTTCGGGAG GCTCAAAGAA GTTACCTGGT ATTTCTGACA 
GTACGTGCAG AGGACTTTTG GGGTGCCTCG GCGCCAACGC 
AGCCAGTGTG GCTGGGGCTG GTGCTCATGC ACACCTTGGC 
CCAGCAGGCT CAGGCCGCCG CATCCGCTGG GACCGGGGCC 
CACGGCCGTA GCCCAGTCCG CGACCCCCTC TGTTTCTTCA 
CGCGACTTCG GGGGCGACTG CCGCCGCCTC CGCCGCCGCA 
TGGCGGGGGA CAACCCCACG ACACCGCCCC ACGCGGGGCA 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 



AGGCTGGAGG 
AATAATCTCC 
TCCCAGTTCT 
GCCATAGACA 
GGGTCATCCG 
TTGGCATCAT 
TCTATTAGCA 
GCCGTCGATA 
CGTAAGAAAC 



CGGATTTTCC 60 
CCAATGATGT 120 
GCTACGAAGA 180 
AGAGGCAGAG 240 
CCACCCCCGT 300 
CAGCGCCGTC 360 
GCCTCCGGGC 420 
CCGGGTCAGG 480 
AGTAGCCC 538 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Ala Arg Arg Leu Pro Lys Pro Thr 
1 5 

j Asp Phe Pro Asp Ser Pro Leu Leu Pro 

20 \ 25 

Asn Asn Leu Pro Asn Asp Val Phe Arg 
35 40 

Val Phe Leu Thr Ser Gin Phe Cys Tyr 

50 . 55 

Phe Gly Val Pro Arg Arg Gin Arg Ala 
65 70 

Ser Val Ala Gly Ala Gly Ala His Ala 

85 



Leu Gin Gly 
10 

Lys Phe Gin 
Glu Ala Gin 



Glu Glu Tyr 
60 

lie Asp Lys 
75 

His Leu Gly 
90 



Thr Pro Val Gin Gin Ala Gin Ala Ala Ala Ser Ala 
100 105 



Arg Leu Glu Ala 

. 15 

Glu Leu Asn Gin 
30 

Arg Ser Tyr Leu 
45 

Val Gin Arg Thr 



Arg Gin Arg Ala 
80 

Gly Ser Ser Ala 
95 

Gly Thr Gly Ala 
110 
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Leu Ala Ser Ser Ala Pro 
115 



Ser Thr Ala Val Ala Gin Ser Ala Thr Pro 
120 125 



Ser Val Ser Ser Ser He 
130 



Ser Ser Leu Arg Ala Ala Thr Ser Gly Ala 
135 140 



Thr Ala Ala Ala Ser Ala 
145 150 



Ala Ala Ala Val Asp Thr Gly Ser Gly Gly 
155 160 



Gly Gly Gin Pro His Asp 

165 



Thr Ala Pro Arg Gly Ala Arg Lys Lys Gin 
170 175 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATGCTATCAG 


GTAACGCAGG 


AGAAGGAGCA 


ACAGCCTGCG 


GAGGTTCGGC 


CGCCGCGGGC 


60 


CAGGACCTCA 


TCAGCGTCCC 


CCGCAACACC 


TTTATGACAC 


TGCTTCAGAC 


CAACCTGGAC 


120 


AACAAACCGC 


CGAGGCAGAC 


CCCGCTACCC 


TACGCGGCCC 


CGCTGCCCCC 


CTTTTCCCAC 


180 


CAGGCAATAG 


CCACCGCGCC 


TTCCTACGGT 


CCTGGGGCCG 


GAGCGGTCGC 


CCCGGCCGGC 


240 


GGCTACTTTA 


CCTCCCCAGG 


AGGTTACTAC 


GCCGGGCCCG CGGGCGGGGA 


CCCGGGTGCC 


300 


TTCTTGGCGA 


TGGACGCTCA 


CACCTACCAC 


CCCCACCCAC 


ACCCCCCTCC 


GGCCTACTTT 


360 


GGCTTGCCGG 


GCCTCTTTGG 


CCCCCCTCCA 


CCCGTGCCTC 


CTTACTACGG 


ATCCCACTTG 


420 


CGGGCAGACT 


ACGTCCCCGC 


TCCCTCGCGA 


TCCAACAAGC 


GGAAAAGAGA 


CCCCGAGGAG 


480 


GATGAAGAAG 


GCGGGGGGCT 


ATTCCCGGGG 


GAGGACGCCA 


CCCTCTACCG 


CAAGGACATA 


540 


GCGGGCCTCT 


CCAAGAGTGT 


GAATGAGTTA 


CAGCACACGC 


TACAGGCCCT 


GCGCCGGGAG 


600 


ACGCTGTCCT 


ACGGCCACAC 


CGGAGTCGGA 


TACTGCCCCC 


AGCAGGGCCC CTGCTACACC 


660 


CACTCGGGGC 


CTTACGGATT 


TCAGCCTCAT 


CAAAGCTACG 


AAGTGCCCAG 


ATACGTCCCT 


720 


CATCCGCCCC 


CACCACCAAC 


TTCTCACCAG 


GCAGCTCAGG 


CGCAGCCTCC 


ACCCCCGGGC 


780 


ACACAGGCCC 


CCGAAGCCCA 


CTGTGTGGCC 


GAGTCCACGA 


TCCCTGAGGC 


GGGAGCAGCC 


840 
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GGGAACTCTG GACCCCGGGA GGACACCAAC CCTCAGCAGC CCACCACCGA GGGCCACCAC 90.0 
CGCGGAAAGA AACTGGTGCA GGCCTCTGCG TCCGGAGTGG CTCAGTCTAA GGAGCCCACC 960 
ACCCCCAAGG CCAAGTCTGT GTCAGCCCAC CTCAAGTCCA TCTTTTGCGA GGAATTGCTG 1020 
' -AATAAACGCG TGGCTTGA 1° 38 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear , 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Leu Ser Gly Asn Ala Gly Glu Gly Ala Thr Ala Cys Gly Gly Ser 
1 5 10 15 

Ala Ala Ala Gly Gin Asp Leu lie Ser Val Pro Arg Asn Thr Phe Met 
20 25 30 

Thr Leu Leu Gin Thr Asn Leu Asp Asn Lys Pro Pro Arg Gin Thr Pro 
35 40 45 

Leu Pro Tyr Ala Ala Pro Leu Pro Pro Phe Ser His Gin Ala lie Ala 
50 55 60 

Thr Ala Pro Ser Tyr Gly Pro Gly Ala Gly Ala Val Ala Pro Ala Gly 
65 70 75 80 

Gly Tyr Phe Thr Ser Pro Gly Gly Tyr Tyr Ala Gly Pro Ala Gly Gly 

85 90 95 

Asp Pro Gly Ala Phe Leu Ala Met Asp Ala His Thr Tyr His Pro His 
100 105 110 

Pro His Pro Pro Pro Ala Tyr Phe Gly Leu Pro Gly Leu Phe Gly Pro 
115 120 125 

Pro Pro Pro Val Pro Pro Tyr Tyr Gly Ser His Leu Arg Ala Asp Tyr 
130 135 ~ 140 

Val Pro Ala Pro Ser Arg Ser Asn Lys Arg Lys Arg Asp Pro Glu Glu 
145 150 155 160 

Asp Glu Glu Gly Gly Gly Leu Phe Pro Gly Glu Asp Ala Thr Leu Tyr 
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165 170 175 

Arg Lys Asp lie Ala Gly Leu Ser Lys Ser Val Asn Glu Leu Gin His 
180 ~ 185 190 

Thr Leu Gin Ala Leu Arg Arg Glu Thr Leu Ser Tyr Gly His Thr Gly 
195 200 205 

Val Gly Tyr Cys Pro Gin Gin Gly Pro Cys Tyr Thr His Ser Gly Pro 
210 ~ 215 220 

Tyr Gly Phe Gin Pro His Gin Ser Tyr Glu Val Pro Arg Tyr Val Pro 
225 230 235 240 

His Pro Pro Pro Pro Pro Thr Ser His Gin Ala Ala Gin Ala Gin Pro 

245 250 255 

Pro Pro Pro Gly Thr Gin Ala Pro Glu Ala His Cys Val Ala Glu Ser 
260 265 270 

Thr He Pro Glu Ala Gly Ala Ala Gly Asn Ser Gly Pro Arg Glu Asp 
275 280 285 

Thr Asn Pro Gin Gin Pro Thr Thr Glu Gly His His Arg Gly Lys Lys 
290 295 300 

Leu Val Gin Ala Ser Ala Ser Gly Val Ala Gin Ser Lys Glu Pro Thr 
305 310 315 320 

Thr Pro Lys Ala Lys Ser Val Ser Ala His Leu Lys Ser He Phe Cys 

325 330 335 

Glu Glu Leu Leu Asn Lys Arg Val Ala 
340 345 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(O) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ala Val Asp Thr Gly Ser Gly Gly Gly Gly Gin Pro His Asp Thr Ala 

5 10 15 

Pro Arg Gly Ala Arg Lys Lys Gin 
20 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ser Thr Ala Val Ala Gin Ser Ala Thr Pro Ser Val Ser Ser Ser He 
5 10 I 5 

Ser Ser Leu Arg Ala Ala Thr Ser Gly Ala Thr Ala Ala Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) 5TRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Gly Val Pro Arg Arg Gin Arg Ala He Asp Lys Arg Gin Arg Ala 

5 10 15 

(2) INFORMATION FOR SEQ ID NO; 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

^&s$^ Gly Gin Pro His Asp Thr Ala Pro Arg Gly Ala Arg Lys Lys Gin 

5 10 15 

(2) INFORMATION FOR SEQ ID NO: 9: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



M 
S 



(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 
y, (A) ORGANISM: Epstein-Barr virus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Thr Ala Val Ala Gin Ser Ala Thr Pro Ser Val Ser 

5 10 

■Q (2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
\£ (B) TYPE: amino acid 

IV (C) STRANDEDNESS: single 

I s * (D) TOPOLOGY: linear 

Q 

14 (ii) MOLECULE TYPE: peptide 

<U ' ' . ■ 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Pro Ser Val Ser Ser Ser lie Ser Ser Leu Arg Ala 

5 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
M (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ser Val Ser Ser Ser lie Ser Ser Leu Arg Ala Ala 

5 10 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser Ser Ser lie Ser Ser Leu Arg Ala Ala Thr Ser 

5 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Ser lie Ser Ser Leu Arg Ala Ala Thr Ser Gly Ala 

5 10 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Epstein-Barr virus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

He Ser Ser Leu Arg Ala Ala Thr Ser Gly Ala Thr 
5 10 

(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Arg Ala Ala Thr Ser Gly Ala Thr Ala Ala Ala Ser 

5 10 

(2) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ala Ala Val Asp Thr Gly Ser Gly Gly Gly Gly Gin 

5 10 

(2) INFORMATION FOR SEQ ID NO: 17 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



o 



n 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 





(xij SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
Ala Val Asp Thr Gly Ser Gly Gly Gly Gly Gin Pro 



(2) INFORMATION FOR SEQ ID NO: 18 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
Val Asp Thr Gly Ser Gly Gly Gly Gly Gin Pro His 



(2) INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Asp Thr Gly Ser Gly Gly Gly Gly Gin Pro His Asp 

5 10 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



5 



10 



47 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Gly Gly Gly Gin Pro His Asp Thr Ala Pro Arg Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 21: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
liillil (D) TOPOLOGY: linear 

-JZ (ii) MOLECULE TYPE: peptide 

W. (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Gly Gin Pro His Asp Thr Ala Pro Arg Gly Ala Arg 

5 10 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Epstein-Barr virus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Gin Pro His Asp Thr Ala Pro Arg Gly Ala Arg Lys 

5 10 



= ii 



